<i>Rich Languages from Poor Inputs</i>

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge-poor Approach to Constructing Word Frequency Lists, with Examples from Romance Languages

Word frequency lists extracted from documents are widely used in many procedures of text clustering and categorization. Usually for compilation of such lists morphological-based approaches (such as the Porter stemmer) to join the words having the same base meaning are used. However such an approach needs many language-dependent linguistic resources or knowledge when working with multilingual da...

متن کامل

Knowledge-poor Approach to Constructing Word Frequency Lists, with Example from Romance Languages

Word frequency lists extracted from documents are widely used in many procedures of text clustering and categorization. Usually for compilation of such lists morphological-based approaches (such as the Porter stemmer) to join the words having the same base meaning are used. However such an approach needs many language-dependent linguistic resources or knowledge when working with multilingual da...

متن کامل

Improved Statistical Machine Translation for Resource-Poor Languages Using Related Resource-Rich Languages

We propose a novel language-independent approach for improving statistical machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. More precisely, we improve the translation from a resourcepoor source language X1 into a resourcerich language Y given a bi-text containing a limited number of parallel sentences for X1-Y and a larger bi-text for X2-Y fo...

متن کامل

Building Text-to-Speech Systems for Resource Poor Languages

The focus of this research is to develop a method for building Text to Speech Systems for resource poor languages by using data from other languages to fine tune a general template polyglot TTS architecture. Our method involves three main componants: language clustering, phoneme mappings and prosody modelling. As a proof of concept, four TTS have been implemented for English, Spanish, Malay and...

متن کامل

Contrastive Learning of Emoji-based Representations for Resource-Poor Languages

The introduction of emojis (or emoticons) in social media platforms has given the users an increased potential for expression. We propose a novel method called Classification of Emojis using Siamese Network Architecture (CESNA) to learn emoji-based representations of resource-poor languages by jointly training them with resource-rich languages using a siamese network. CESNA model consists of tw...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ENGLISH LINGUISTICS

سال: 2017

ISSN: 0918-3701,1884-3107

DOI: 10.9793/elsj.33.2_616